Template Design for Information Extraction

نویسنده

  • Boyan A. Onyshkevych
چکیده

The design of the template for an infonnation extraction application (or exercise) reflects the nature of the task and therefore crucially affects the success of the attempt to capture infonnation from text. This paper addresses the template design requirement by discussing the general principles or desiderata of template design, object-oriented vs. flat template design, and template definition notation, all reflecting the results and lessons learned in the TIPSTERJMUC-5 template definition effort which is explicitly discussed in a Case Study in the last section of this paper.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Diversity of Scenarios in Information extraction

This paper discusses/presents problems of template structure for Information Extraction. We investigate these problems in the context of two new Information Extraction scenarios which are linguistically and structurally more challenging than the traditional MUC scenarios. By a scenario we mean a predefined set of facts to be extracted from text. Traditional views on event structure and template...

متن کامل

The LOLITA User-Definable Template Interface

The development of user-definable templates interfaces which allow the user to design new templates definitions in a user-friendly way is a new issue in the field of information extraction. The LOLITA user-definable templates interface allows the user to define new templates using sentences in natural language text with a few restrictions and formal elements. This approach is rather different f...

متن کامل

A Soft and Efficient Approach for Removal of Template from Mesoporous Silica using Benzene Sulfonamide

In this contribution, an effective and soft method for removal of template from nanochannels of mesoporous silica (MCM-41) is proposed. This method is based on chemically-modified solvent extraction which enhanced by means of an auxiliary organic compound, i.e. benzene sulfonamide. Template removal was performed in soft condition, i.e. in the presence of diluted sulfuric acid and at ambient tem...

متن کامل

Issues and Methodology for Template Design for Information Extraction

The goal of Information Extraction tasks is to identify, categorize, classify, relate, and normalize specific information of interest found in free text, and to make that information available to a back-end data base, data fusion, or other application. A data structure referred to as a template is typically used for capturing such information, particularly in cases where the amount and complexi...

متن کامل

Tasks, Domains, and Languages for Information Extraction

The information extraction tasks for the ARPA TIPSTER program center on automatically filling object-oriented data structures, called templates, with information extracted from free text in news stories (for discussion of templates and objects, see "Template Design for Information Extraction" in this volume). With text as input, the TIPSTER systems first detect whether the text contains relevan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993